O'Bryen, Barbara • 



.t 



From: 

Sent: 

To: 

Subject: 



Goldberg, Jeanine 

Tuesday, April 02, 2002 7:34 AM 

O'Bryen, Barbara 

09/762,724- MSG Pneumocystis carinii 




Please place results on DISK. 

1 . Please search SEQ ID NO: 17-20, 23, 24 

2. Please search SEQ ID NO: 13, positions 2821-3072 

3. Please search SEQ ID NO: 3, positions 2758-3006 

4. Please search SEQ ID NO: 1 , positions 2894-3042 

5. Please search SEQ ID NO: 5, posigions 2845-3090 

THANK YOU 
Jeanine 

Jeanine Enewold Goldberg 
1634 

CM1--12D11 
Mailbox- 12E12 
306-5817 



Point of Contact: 
Barb O'Bryen 
Technical Information Specialist 
STIC CM1 6A05 308-4291 



A 5 ^ 



r 



SEQ ID NO: 
RESULT 5 
PMCMSGI 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
FEATURES 

source 



mat_peptide 



gene 



CDS 



13, nucleotides 2821-3072 



PMCMSGI 3363 bp DNA PLN 26-SEP-1994 

Pneumocystis carinii B-cell receptor (msgl) gene, 3' end. 
L27092 

L27092.1 GI:535706 
B-cell receptor. 
Pneumocystis carinii DNA. 
Pneumocystis carinii 

Eukaryota ; Fungi ; Ascomycota ; Pneumocys tidomycetes ; 
Pneumocystidaceae; Pneumocystis. 
1 (bases 1 to 3363) 
Garbe,T.R. and Stringer, J. R. 

Molecular characterization of clustered variants of genes encoding 
major surface antigens of human Pneumocystis carinii 
Infect. Immun. 62 (8), 3092-3101 (1994) 
94314421 

Location/Qualifiers 
1. .3363 

/organism= M Pneumocystis carinii" 
/db_xref="taxon: 4754" 
152. .3241 
/gene="msgl " 

/standard_name=" major surface antigen" 
/evidence=experimental 
/product-"B-cell receptor" 
152. .3244 
/gene="msgl" 
<152. .3244 
/ gene="msgl" 
/codon_start=l 
/product="B-cell receptor" 
/protein_id="AAA21645. 1" 
/db_xref="GI: 535707" 

/translation="VARAVKRQVAGVKNNEAEERLFALIMRADYKDESKCKNKIKEYC 
DGLKNASLTSEEVHKELKDFCKDGSQGKKCEELKKNVEAKCNNFKTKLEGLVKKDASG 
LTNDDCKENERQCLFLEGACPDLVEDCSKLRNLCYQKKREGVAEEVLLRALRGDLGNK 
TECEKKIKDVCPKIGQESDELTLLCLDQKKTCTNLMTARDKKCNTLEEDVKKALENKN 
NLLGKCLPLLEHAT FTEGTAKKASQCTPNKDCEDYLPKCDELAEECGKKGIIYIHPGP 
DFDPTKPEPTVAEDIGLEELYKKAAEDGVHIGKPPVRDATALLALLIQNPDPKIQANE 
KEKCKKVLENKCKELKKHEVLGDLCNQNAASQSGTKKCEELEKELANSTKI LSEKI KN 
KHLSGSGETIPWYKLSTFLSDSDCARLESDCFYFAQDKDPLKKECKNVKAACYKRGLD 
ARANKVLQENMRGLLRGSNQSWLKKFQQELVKVCEKLKEENKGSFSNDELFVLCVQPA 
KAARLLTHDLRMTTIFLRQQLDQKRDFPTVKTAKGIREKCQDLGKGFQKEITWPCHTL 
EQQCNRLGTTEILKQVLLNEHKDTLKTHENCVTYLKEKCNKWSRRGDDRFSFVCVFQN 
ATCKLMVKDVQDRCKIFKENIKVSEIVDFLKNNTNNITTLERNCPSWHTYCNRFSSNC 
PDFSKKNPCTKIKNNCKPFYERKALEDALKVELRGKLSDENKCTAALKGYCTLAGNW 
NASVRSLCKDNTQGSNKKTDEKWEELCKKLMEEVKEQCETLPAELKQPADDLEKDVK 
TYEELKEEAKKAMNKSSLVLSFVKKDGNNTPKNNSKSEDKNWSNEKDTIKHVKILRR 
GVKDVLVTELEAKAFDLAAEVFGRYVDLKERCEKLTLDCGIKDDCDGLKGVCGKIKKK 
CRDLKPLEVKSHEIVTESTTTTTTTTTTVTDPKATECKSLQTTDTWVTQTSTHTSTST 
ITSTITSKITLTSTRRCKPTKCTTGDDAEDVKPSEGLRVSGWNVMRGAIVAMVISFMI 



BASE COUNT 
ORIGIN 



1312 a 



505 c 753 g 



793 t 



Query Match 80.5%; Score 202.8; DB 8; 

Best Local Similarity 90.9%; Pred. No. 2.8e-47; 
Matches 229; Conservative 0; Mismatches 17; 



Length 3363; 
Indels 6; Gaps 



Qy 1 gaatgcaaatccttacagacaacagacacatgggttacacagacatcgacacacacaagc 60 

I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2999 GAAT G C AAAT C CT T AC AGAC AACAGAC AC AT GGGT T ACAC AGAC AT C G AC ACAC AC AAG C 3058 

Qy 61 acgtctactatcacatctaccatcacatcaaaaataacattgacatcaacgaggcgatgc 120 

II I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I II I I I I I III 

Db 3059 AC AT CT AC CAT C ACAT CT AC GAT T AC AT C AAAAAT AAC AT T GACAT C AAC AAGG C G GT G C 3118 

Qy 121 aaaccaaccaagtgtacgacaggagaggaagatgatgcaggagacgtgaaaccgagtgag 180 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I II I I I I I 

Db 3119 AAAC CAAC CAAGT GT AC GAC AGG GGAT GAT G CAGAAGAC GT GAAG C CAAGT GAA 3172 

Qy 181 gggctgaggatgagtgggtggaatgtgatgaggggggtgatagtagcaatggttatttcg 240 

II I I I I I MM I I I I I II II I I II II II I I II I II I I II I II II M II II I II I 

Db 3173 GGCTTGAGGGTGAGCGGGTGGAATGTGATGAGGGGGGCAATAGTAGCAATGGTTATTTCG 3232 

Qy 241 ttcatgatttag 252 

II II M I I II I I 
Db 3233 TTCATGATTTAG 3244 



SEQ ID NO: 
RESULT 7 
PMCMSGI 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
FEATURES 

source 



3, nucleotides 2758-3006 



PLN 

receptor (msgl) 



gene, 



26-SEP- 
3' end 



-1994 



mat_peptide 



gene 



CDS 



PMCMSGI 3363 bp DNA 

Pneumocystis carinii B-cell 
L27092 

L27092. 1 GI:535706 
B-cell receptor. 
Pneumocystis carinii DNA. 
Pneumocystis carinii 

Eukaryota; Fungi ; Ascomycota; Pneumocystidomycetes ; 
Pneumocystidaceae; Pneumocystis . 
1 (bases 1 to 3363) 
Garbe,T.R. and Stringer, J. R. 

Molecular characterization of clustered variants of genes encoding 
major surface antigens of human Pneumocystis carinii 
Infect. Immun. 62 (8), 3092-3101 (1994) 
94314421 

Location/Qualifiers 
' 1. .3363 

/organism^" Pneumocystis carinii" 
/db_xref="taxon: 4754" 
152. .3241 
/gene="msgl" 

/standard__name="ma jor surface antigen" 
/evidence=experimental 
/product=" B-cell receptor" 
152. .3244 
/gene-"msgl" 
<152. .3244 
/ gene="msgl " 
/codon_start-l 
/product^" B-cell receptor" 
/protein_id="AAA21645. 1" 
/db_xref="GI: 535707" 

/trans la tion="VARAVKRQVAGVKNNEAEERLFALIMRADYKDESKCKNKIKEYC 
DGLKNASLTSEEVHKELKDFCKDGSQGKKCEELKKNVEAKCNNFKTKLEGLVKKDASG 
LTNDDCKENERQCLFLEGACPDLVEDCSKLRNLCYQKKREGVAEEVLLRALRGDLGNK 
TECEKKIKDVCPKIGQESDELTLLCLDQKKTCTNLMTARDKKCNTLEEDVKKALENKN 
NLLGKCLPLLEHATFTEGTAKKASQCTPNKDCEDYLPKCDELAEECGKKGIIYIHPGP 
DFDPTKPEPTVAEDIGLEELYKKAAEDGVHIGKPPVRDATALLALLIQNPDPKIQAInIE 
KEKCKKVLENKCKELKKHEVLGDLCNQNAASQSGTKKCEELEKELANSTKILSEKIKN 
KHLSGSGETIPWYKLSTFLSDSDCARLESDCFYFAQDKDPLKKECKNVKAACYKRGLD 
ARANKVLQENMRGLLRGSNQSWLKKFQQELVKVCEKLKEENKGSFSNDELFVLCVQPA 
KAARLLTHDLRMTTIFLRQQLDQKRDFPTVKTAKGIREKCQDLGKGFQKEITWPCHTL 
EQQCNRLGTTEILKQVLLNEHKDTLKTHENCVTYLKEKCNKWSRRGDDRFSFVCVFQN 
ATCKLMVKDVQDRCKIFKENIKVSEIVDFLKNNTNNITTLERNCPSWHTYCNRFSSNC 
PDFSKKNPCTKIKNNCKPFYERKALEDALKVELRGKLSDENKCTAALKGYCTLAGNW 
NASVRSLCKDNTQGSNKKTDEKWEELCKKLMEEVKEQCETLPAELKQPADDLEKDVK 
TYEELKEEAKKAMNKSSLVLSFVKKDGNNTPKNNSKSEDKNWSNEKDTIKHVKILRR 
GVKDVLVTELEAKAFDLAAEVFGRYVDLKERCEKLTLDCGI KDDCDGLKGVCGKI KKK 
CRDLKPLEVKSHEIVTESTTTTTTTTTTWDPKATECKSLQTTDTWVTQTSTHTSTST 
ITSTITSKITLTSTRRCKPTKCTTGDDAEDVKPSEGLRVSGWNVMRGAIVAMVTSFMI 



BASE COUNT 
ORIGIN 



1312 a 



505 c 



753 g 793 t 



Query Match 61.6%; Score 153.4; DB 8; Length 3363; 

Best Local Similarity 79.1%; Pred. No. 1.2e-33; 

Matches 197; Conservative 0; Mismatches 46; Indels 6; Gaps 

Qy 1 gactgccactctttacagacaacagatacgtgggtcacaaagacgtcgacccatactagc 60 

II Ml I II I I II I I I I I I I I I I II I I I I I II I I I I I I I I I I II II III 
Db 2 999 GAAT G CAAAT C CT TACAGACAACAGACACAT GGGT TACACAGACAT C GACACACACAAG C 3058 

Qy 61 acatccacaaccacatctacagtcacgtcaagaataacgttgacctcgacaagacggtgt 120 

II I I I II I I I I I I I I I I I II II I I I I I I I I I I II I II Mill I I I I I 
Db 3059 ACAT CT AC CAT CACAT CTAC GAT T ACAT CAAAAAT AACAT T GACAT CAACAAG GC GGT G C 3118 

Qy 121 aagcctacgaagtgtacgacaggagaggaagatgaagcaggagacgtgaaaccgagtgaa 180 

II II II II I II I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I 

Db 3119 AAAC C AAC C AAGT GT AC G AC AG G GGAT GAT GCAGAAGACGT GAAGCCAAGT GAA 3172 

Qy 181 gggttgaggatgagtggatggagtgtgatgaggggggtgttattagcaatgacgatttca 240 

I I I I I II I I I I I II I I I I II I I I I I I I I I II I II I I I I I I I I I II I I 
Db 3173 GGCTTGAGGGTGAGCGGGTGGAATGTGATGAGGGGGGCAATAGTAGCAATGGTTATTTCG 3232 

Qy 241 ttcatgatt 249 

I II I I I I I I 
Db 3233 TTCATGATT 3241 



SEQ ID NO: 
RESULT 7 
PMCMSGI 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



1, nucleotides 2894-3042 



PMCMSGI 3363 bp DNA PLN 26-SEP-1994 

Pneumocystis carinii B-cell receptor (msgl) gene, 3' end. 
L27092 

L27092.1 GI: 535706 
B-cell receptor. 
Pneumocystis carinii DNA. 
Pneumocystis carinii 
Eukaryota; Fungi; Ascomycota; Pneumocystidomycetes ; 
Pneumocystidaceae; Pneumocystis . 
REFERENCE 1 (bases 1 to 3363) 

AUTHORS Garbe,T.R. and Stringer , J . R. 

TITLE Molecular characterization of clustered variants of genes encoding 

major surface antigens of human Pneumocystis carinii 
JOURNAL Infect. Immun . 62 (8), 3092-3101 (1994) 
MEDLINE 94314421 
FEATURES Location/Qualifiers 
source 1. .3363 

/organism^" Pneumocystis carinii" 
/db_xref= M taxon:4754" 
mat_peptide 152. .3241 

/gene="msgl " 

/standard_name="ma jor surface antigen" 

/evidence=experimental 

/product="B-cell receptor" 
gene 152. .3244 

/ gene= n msgl u 
CDS <152. .3244 

/gene= lf msgl " 

/codon_start-l 

/product=" B-cell receptor" 

/protein_id="AAA21645. 1" 

/db_xref="GI: 535707" 

/trans la tion="VARAVKRQVAGVKNNEAEERLFALIMRADYKDESKCKNKIKEYC 
DGLKNASLTSEEVHKELKDFCKDGSQGKKCEELKKNVEAKCNNFKTKLEGLVKKDASG 
LTNDDCKENERQCLFLEGACPDLVEDCSKLRNLCYQKKREGVAEEVLLRALRGDLGNK 
TECEKKIKDVCPKIGQESDELTLLCLDQKKTCTNLMTARDKKCNTLEEDVKKALENKN 
NLLGKCLPLLEHATFTEGTAKKASQCTPNKDCEDYLPKCDELAEECGKKGIIYIHPGP 
DFDPTKPEPTVAEDIGLEELYKKAAEDGVHIGKPPVRDATALLALLIQNPDPKIQANE 
KEKCKKVLENKCKELKKHEVLGDLCNQNAASQSGTKKCEELEKELANSTKI LSEKI KN 
KHLSGSGETIPWYKLSTFLSDSDCARLESDCFYFAQDKDPLKKECKNVKAACYKRGLD 
ARANKVLQENMRGLLRGSNQSWLKKFQQELVKVCEKLKEENKGSFSNDELFVLCVQPA 
KAARLLTHDLRMTTIFLRQQLDQKRDFPTVKTAKGIREKCQDLGKGFQKEITWPCHTL 
EQQCNRLGTTEILKQVLLNEHKDTLKTHENCVTYLKEKCNKWSRRGDDRFSFVCVFQN 
ATCKLMVKDVQDRCKIFKENIKVSEIVDFLKNNTNNITTLERNCPSWHTYCNRFSSNC 
PDFSKKNPCTKIKNNCKPFYERKALEDALKVELRGKLSDENKCTAALKGYCTLAGNW 
NASVRSLCKDNTQGSNKKTDEKWEELCKKLMEEVKEQCETLPAELKQPADDLEKDVK 
TYEELKEEAKKAMNKSSLVLSFVKKDGNNTPKNNSKSEDKNWSNEKDTIKHVKILRR 
GVKDVLVTELEAKAFDLAAEVFGRYVDLKERCEKLTLDCGIKDDCDGLKGVCGKIKKK 
CRDLKPLEVKSHEIVTESTTTTTTTTTTVTDPKATECKSLQTTDTWVTQTSTHTSTST 
ITSTITSKITLTSTRRCKPTKCTTGDDAEDVKPSEGLRVSGWNVMRGAIVAMVISFMI 



BASE COUNT 
ORIGIN 



1312 a 



505 c 



753 g 793 t 



Query Match 57.3%; Score 85.4; DB 8; Length 3363; 

Best Local Similarity 78.5%; Pred. No. 4.5e-14; 

Matches 117; Conservative 0; Mismatches 26; Indels 6; Gaps 



Qy 


1 


tgacctcgacgaggcggtgtaagcctacgaagtgtacgacaggagaggaagatgaagcag 
1 1 1 1 II II 1 1 1 1 1 1 1 1 II II II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 MM 


60 


Db 


3099 


T G AC AT C AAC AAG G C G G T G C AAAC C AAC C AAG T G T AC GAC AG G GG AT GAT G CAG 


3152 


Qy 


61 


gagaggtgaagccgagtgaggggctgaggatgagtgggtggagtgtgatgagaggggtgt 
III M II II II 1 M II II II II 1 MM II 1 1 M 1 M II II 1 II MM 

AAGACGTGAAGCCAAGTGAAGGCTTGAGGGTGAGCGGGTGGAATGTGATGAGGGGGGCAA 


120 


Db 


3153 


3212 


Qy 


121 


tattagcaatgatgatttcattcatgatt 14 9 

II 1 M II II 1 1 II M 1 II II II 1 II 

T AGT AG C AAT G GT T AT T T C GT T CAT GAT T 3241 




Db 


3213 





SEQ ID NO: 
RESULT 1 
AF033208 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



FEATURES 

source 



5, 2845-3090 nucleotides 



AF033208 3089 bp 

Pneumocystis carinii f. 
glycoprotein (MSG) gene, 
AF033208 

AF033208.1 GI:3560512 



DNA PLN 10-SEP-1998 

sp. hominis clone HUMSG11 major surface 
partial cds . 



Pneumocystis carinii f. 
Pneumocystis carinii f. 



gene 



CDS 



sp. hominis. 
sp. hominis 

Eukaryota; Fungi; Ascomycota; Pneumocystidomycetes ; 
Pneumocystidaceae; Pneumocystis . 

1 (bases 1 to 3089) 
Garbe,T.R. and Stringer, J. R. 

Molecular characterization of clustered variants of genes encoding 
major surface antigens of human Pneumocystis carinii 
Infect. Immun. 62 (8), 3092-3101 (1994) 
94314421 

2 (bases 1 to 3089) 

Mei,Q., Turner, R.E., Serial, V., Klivington, D . , Angus, C.W. and 
Kovacs, J . A. 

Characterization of major surface glycoprotein genes of human 
Pneumocystis carinii and high-level expression of a conserved 
region 

Infect. Immun. 66 (9), 4268-4273 (1998) 
98380374 

3 (bases 1 to 3089) 

Mei,Q., Turner, R. , Sorial,V., Klivington, D . , Angus, C.W. and 
Kovacs , J . A. 
Direct Submission 

Submitted (07-NOV-1997) Critical Care Medicine Dept., National 
Institutes of Health, 10 Center Drive, MSC 1662, Bethesda, MD 
20892-1662, USA 

Location/Qualifiers 

1. .3089 

/organism= "Pneumocystis carinii f. sp. hominis" 

/sub__species=" hominis" 

/specif ic_host= "Homo sapiens" 

/ db_xr e f = " t axon = 42068" 

/clone= M HUMSGll" 

/note= M derived from HIV-infected human with P. carinii 

pneumonia" 

<1. .3089 

/gene="MSG" 

<1. .3089 

/gene="MSG" 

/note=" surface antigen" 
/ codon_start=3 

/product= "major surface glycoprotein" 
/protein_id= "AAC34971 . 1" 
/db_xref= n GI :3560513" 

/translations "ARAVKRRAKGAQNSIDEEHVLALILKKNGLEDTKCKTKLEEYCK 
TLTNAGLNPEKVHEKLKDFCDNGKRNEKCQDLKNKVNQKCIKFQGKLQTAAGKKISEL 
TDEDCKKNEQQCLFLEGACPTELKDDCNKLRNNCYQKERNNVAEEVLLRALRGDLNET 
KTCEKKLKEVCPKLERESDELTELCLYQKTTCVSLVTKGKSKCDTLEKEVEEALKKNE 
LREKCLLLLEQCYFHRGNCEGDKSKCNKPNNKDCKEYVPECDELAEKCGKENIVYMHP 
GSDFDPTKPEPTLAEDIGLEELYKRAEEDGIFVGRQHVRDATALLALLLKKTLKKEEC 
IKALKKNCENPHEHEALENLCKENKPSSDGTKKCDELEKDVNKTCTSLTSTILKNRLY 
ISPDGIAEWGKLPTFLSDEDCAKLESYCFYYKETCPDVKEACMNVRAACYKRGLDARA 
NSVLQKNMRGLLHGSNKDWLKKFQQELAKVCEKLKGNKGSFSNDELFVLCIQPAKAAR 
LLTHHHQMRVIFLRQQLDQKRDFPTDKDCKELGRKCQDLGKDSKEITWPCHTLEQQCN 
RLGITEILKQILLDEHKDTLKSHENCAKYLKRKCHKWSRRGDDRFSFVCVFQNATCEL 



MVKDVQDRCKIFEENMQASDINDSLKKNQIKAESAANICPSWHPYCDRFLPNCPDLKK 
GKTFCQNLKKYCEPFYKRKVLEDALKVELRGNLSNITKCEPALERYCTVLKDVNNASI 
SSLCKDNTESKTKKADNKNVRKKLCLKLVEEVEQQCKVLPTELTELEKSLKKDVKTYE 
E LKERAKKAMNKS S LVLS LVKKNE SNTS KNNS KNKDKNWSNGLQDTTKYVKI LRRGV 
KEALVTESEAKAFDLAAEVFGRYVDLKEKCEKLTSDCGIKDDCDGLKEVCGKIEKTCH 
DLKPLEVKSHEIVTESTTTTTTTTTTVTDPKATECKSLQTTDTWVTQTSTHTSTSTIT 
STITSKITLTSTRRCKPTKCTTGDEAGDVKPSEGLKMSGWSVMRGVIVAMVTSFMI " 

BASE COUNT 1254 a 447 c 662 g 726 t 

ORIGIN 



Query Match 100.0%; Score 246; DB 8; Length 3 0 89; 

Best Local Similarity 100.0%; Pred. No. 1.3e-57; 

Matches 246; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 gaatgcaaatccttacagacaacagatacatgggttacacagacatcgacacacacaagc 60 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 2 844 GAATGCAAATCCTTACAGACAACAGATACATGGGTTACACAGACATCGACACACACAAGC 2903 
Qy 61 acgtctaccatcacatctaccatcacatcaaaaataacattgacatcaacgaggcgatgc 120 

1 1 1 1 1 1 , 1 , 1 1 ! I ' I! 1 1 1 1 M , 1 . 1 1 1 1 1 1 ^ I 1 1 ! 1 , 1 1 1 . 1 1 II 1 1 1 1 , 1 1 1 M ! 

Db 2904 ACGTCTACCATCACATCTACCATCACATCAAAAATAACATTGACATCAACGAGGCGATGC 2963 
Qy 121 aaaccaaccaagtgtacgacaggggatgaagcaggagacgtgaaaccgagtgagggattg 180 

II I II 1 1 III II II III I III II 1 1 II I II 1 1 II 1 1 II Ml 1 1 II 1 1 II 1 1 II II 1 1 II I 

Db 2 964 AAACCAACCAAGTGTACGACAGGGGATGAAGCAGGAGACGTGAAACCGAGTGAGGGATTG 3023 
Qy 181 aagatgagtgggtggagcgtgatgaggggggtgatagtagcaatggttatttcgttcatg 240 

IIIIIIIIIIIIIIIIIIIIIIIIIMMIMIIIIIIIIIMIIIIIIIIIIIIIIIII 

Db 3 024 AAGATGAGTGGGTGGAGCGTGATGAGGGGGGTGATAGTAGCAATGGTTATTTCGTTCATG 3083 

Qy 241 atttag 246 
MINI 

Db 3084 ATTTAG 3089 



SEQ ID NO: 
RESULT 6 
PMCMSGI 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
FEATURES 

source 



mat_pepticie 



gene 



CDS 



5, nucleotides 2845-3090 



PMCMSGI 3363 bp DNA PLN 26-SEP-1994 

Pneumocystis carinii B-cell receptor (msgl) gene, 3 1 end. 
L27092 

L27092.1 GI:535706 
B-cell receptor. 
Pneumocystis carinii DNA. 
Pneumocystis carinii 

Eukaryota; Fungi; Ascomycota; Pneumocystidomycetes ; 
Pneumocystidaceae; Pneumocystis. 
1 (bases 1 to 3363) 
Garbe,T.R. and Stringer , J. R. 

Molecular characterization of clustered variants of genes encoding 
major surface antigens of human Pneumocystis carinii 
Infect. Immun. 62 (8), 3092-3101 (1994) 
94314421 

Location/Qualifiers 
1. .3363 

/organism^" Pneumocystis carinii 11 
/ db_xr e f = " t axon : 4 7 5 4 " 
152. .3241 
/ gene="msgl" 

/standard_name="major surface antigen" 
/ evidence=experimental 
/product="B-cell receptor" 
152. .3244 
/gene="msgl " 
<152. .3244 
/gene- fl msgl " 
/codon_start=l 
/product="B-cell receptor" 
/protein_id="AAA21645. 1" 
/dbjcref ="GI : 535707" 

/translation="VARAVKRQVAGVKNNEAEERLFALIMRADYKDESKCKNKIKEYC 
DGLKNASLTSEEVHKELKDFCKDGSQGKKCEELKKNVEAKCNNFKTKLEGLVKKDASG 
LTNDDCKENERQCLFLEGACPDLVEDCSKLRNLCYQKKREGVAEEVLLRALRGDLGNK 
TECEKKIKDVCPKIGQESDELTLLCLDQKKTCTNLMTARDKKCNTLEEDVKKALENKN 
NLLGKCLPLLEHATFTEGTAKKASQCTPNKDCEDYLPKCDELAEECGKKGIIYIHPGP 
DFDPTKPEPTVAEDIGLEELYKKAAEDGVHIGKPPVRDATALLALLIQNPDPKIQANE 
KEKCKKVLENKCKELKKHEVLGDLCNQNAASQSGTKKCEELEKELANSTKILSEKIKN 
KHLSGSGETIPWYKLSTFLSDSDCARLESDCFYFAQDKDPLKKECKNVKAACYKRGLD 
ARANKVLQENMRGLLRGSNQSWLKKFQQELVKVCEKLKEENKGSFSNDELFVLCVQPA 
KAARLLTHDLRMTTIFLRQQLDQKRDFPTVKTAKGIREKCQDLGKGFQKEITWPCHTL 
EQQCNRLGTTEILKQVLLNEHKDTLKTHENCVTYLKEKCNKWSRRGDDRFSFVCVFQN 
ATCKLMVKDVQDRCKIFKENIKVSEIVDFLKNNTNNITTLERNCPSWHTYCNRFSSNC 
PDFSKKNPCTKIKNNCKPFYERKALEDALKVELRGKLSDENKCTAALKGYCTLAGNVN 
NASVRSLCKDNTQGSNKKTDEKWEELCKKLMEEVKEQCETLPAELKQPADDLEKDVK 
TYEELKEEAKKAMNKSSLVLSFVKKDGNNTPKNNSKSEDKNWSNEKDTIKHVKILRR 
GVKDVLVTELEAKAFDLAAEVFGRYVDLKERCEKLTLDCGIKDDCDGLKGVCGKIKKK 
CRDLKPLEVKSHEIVTESTTTTTTTTTTVTDPKATECKSLQTTDTWVTQTSTHTSTST 
ITSTITSKITLTSTRRCKPTKCTTGDDAEDVKPSEGLRVSGWNVMRGAIVAMVISFMI 



BASE COUNT 
ORIGIN 



1312 a 505 c 753 g 793 t 



Query Match 87.6%; Score 215.6; DB 8; Length 3363; 

Best Local Similarity 92.3%; Pred. No. 3.4e-49; 

Matches 227; Conservative 0; Mismatches 19; Indels 0; Gaps 



Qy 1 gaatgcaaatccttacagacaacagatacatgggttacacagacatcgacacacacaagc 60 

I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I II I I I I I I I I 

Db 2999 GAATGCAAATCCTTACAGACAACAGACACAT GGGTTACACAGACAT CGACACACACAAGC 3058 

Qy 61 acgtctaccatcacatctaccatcacatcaaaaataacattgacatcaacgaggcgatgc 120 

II I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I II I I I I I I I I I I I III 

Db 3059 ACATCTACCAT CACAT CT ACGATTACAT CAAAAATAACATT GACAT CAACAAGGCGGTGC 3118 

Qy 121 aaaccaaccaagtgtacgacaggggatgaagcaggagacgtgaaaccgagtgagggattg 180 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I II III 
Db 3119 AAAC CAAC CAAGT GT AC GACAGG G GAT GAT GC AGAAGAC GT GAAGC CAAGT GAAGGCT T G 3178 

Qy 181 aagatgagtgggtggagcgtgatgaggggggtgatagtagcaatggttatttcgttcatg 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3179 AGGGTGAGCGGGTGGAATGTGATGAGGGGGGCAATAGTAGCAATGGTTATTTCGTTCATG 3238 

Qy 241 atttag 246 

I I I I I I 
Db 3239 ATTTAG 3244 



Gen Bank database [>Kumcnt Reader 
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LOCUS 

DEFINITION 

ACCESSION 
NID 

KEYWORDS 
SOURCE 

ORGANISM 



PCU3966G 629 bp mRNA P*.. 29-NOV-1995 

Pneumocystis carinii major surface glycoprotein (msg) mRNA, partial 

cds . 

U39660 

gl079706 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



Pneumocystis carinii. 
Pneumocystis carinii 

Eukaryotae; mitochondrial eukaryotes; Fungi; Ascomycota; 
Archaeascomycetes; Pneumocystidaceae; Pneumocystis . 
1 (bases 1 to 629) 

Kovacs,J.A., Edman,J.C. and Angus, C.W. 
Direct Submission 

Submitted ( 30-OCT-1995 ) J. A. Kovacs, CCMD, NIH, Building 10, Rm 
7D43, MSC 1662, Bethesda, MD 20892-1662, USA 
Location/Qualifiers 
1. .629 

/organism^" Pneumocystis carinii" 

/db_xref="taxon: 4754" 
gene 55. . 629 

/gene="msg" 
CDS " 55 . . >629 

/gene="msg" 

/codon_start=l 

/product="ma jor surface glycoprotein" 
/db_xref="PID:gl07 9707" 

/translation="MRIAFFALFAQLSCILVYSIAERDFMSLDEIYGGDISFDHEKLE 
,. r . . . FNEYNQVLQMLEKAKKLGTGFVDRTKDFSNRRYEGRIELNHLGRRPGVDYFRKGGDVF 

TDGYPRGGHLIEDELSEEAAMAR PVKRQAVQGAQDEIDEKHLLAFI VKDKYKEEQKCK 
EELEKYCKELKEADKNLENVDDKVKGLCDDK" 
misc_dif ference 228.. 234 

/gene="msg" 

/note="in some cDNA clones, there are 6 A's at this 

position, instead of 7. This appears to be due to an error 

in reverse transcription" 

/ replace= ,, aaaaaa" 
BASE COUNT 218 a 73 c 154 g 184 t 

ORIGIN 

1 ggaaaatata ttttttcttg atatcegtet cgtcttcagt ttgtttgtgc aataatgagg 
61 attgeatttt ttgcgctttt tgegcaaett agttgtattt tagtttattc aatagcagaa 
121 agggatttca tgtcattaga tgaaatatat ggaggegata taagttttga tcatgaaaaa 
181 ctcgaattta acgaatataa tcaagtttta caaatgettg aaaaggcaaa aaaattggga 
241 aceggctttg ttgatagaac caaagatttt tctaatagac gatatgaagg gagaattgag 
301 ttaaatcatt tggggagacg cccaggagtc gactatttta ggaaaggtgg ggatgttttt 
361 actgatggtt atcctcgtgg aggtcatttg atcgaggatg agttgtccga agaggeggea 
421 atggcacggc eggttaagag gcaagcagta caaggagcac aagatgagat tgatgagaaa 
481 caccttttgg ctttcattgt gaaggacaaa tataaagaag aacaaaaatg caaagaagaa 
541 ctcgagaaat attgtaaaga gttgaaggaa gcagataaaa atctagagaa tgtggatgat 
601 aaagttaaag gactttgtga tgataaaaa 

// 



! ofl 
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ACCESSION 
NID 

KEYWORDS 
SOURCE 



LOCUS 

DEFINITION 



AF04310L 4002 bp mRNA PLN 06-JUN-1998 

Pneumocystis carinii surface glycoprotein A (gpA) mRNA, complete 
cds . 

AF043102 
g3184385 



JOURNAL 
REFERENCE 
AUTHORS 



REFERENCE 
AUTHORS 



ORGANISM 



TITLE 



TITLE 
JOURNAL 




FEATURES 



source 



CDS 



gene* 



Location/Qualifiers 
1. . 4002 

/organism=" Pneumocystis carinii" 
/db_xref="taxon: 47 54" 
/lab_host="C.B-17 SCID mouse" 
1. .4002 
/gene="gpA" 
52. .3849 



/gene="gpA" 
/codon_start=l 

/product= M surf ace glycoprotein A" 
/db_xref="PID:g318 438 6" 

/ translation="MRIAFFALFAQLSYVLGSFLRDRDLPNEEDVYGYENFGLDPNSP 
EPSEFLNIIEMLRNAKRLGSGKINQGQFLSNRKARRDFDLCRSCNRPGVDYFRKSDYD 
GFFSEDFSSEDYSQDKRWVEEVAQKEAAMAQPVKRQAAGQAAGNDEIKEEQVLGLIVK 
SGYNNDNKCKANLKHYCEELKKIDGKLESVDVKVKGLCENGKEGEKCKELKKKLETEL 
GAFKTEVENALNNLTDEKCRKYEEKCLLLEEGDPNNLEEKCVKLRDRCYRQRRQGVAK 
EILLRALEGKVNNKDECKKRMKEICQGLSEYSDELVFSCFNSDKTCEYLQKNHGDSCK 
PLEKELEDKELVEKCQEYLEKCYFYGSSCKEjTKCDKVNNKCKGKGIEYEGPKLDFSPV 
REKPRFPEKIEVENLYKKEEAKGIIVGKPKMKTLRDLALLLIKERNGKDEGEKCKKAL 
EDCESFKHLDYGLEELCGDKDKEDRCKELVEVEDRCTNFKLELYLKGLSTEFEKDKES 
DYFSWGQVSKLVSMEDCIKFESECFHLERVCTNKIGKACENVRVACYKKGQDRVLNRY 
FQEGLKGLIGDLELVTENLEKCQKSWGNYTKLKEDRRYFTKCHLPTKLCYELLDDVI 
LQSEELEWLNLRRDFPRKEDCVELKKKCKDLESDSYLNHEKCDTLNRRCEYLKVTEE 
LRKRLLKRGDDALRTQGNCTAVLKKECEELSKRGKDEFSVSCALREETCSFMVEQTEN 
ECLFLKNNMENGKIINKIEKGNETLVEELCTLFDPYCHQYIENCPDRLKKASNSNhCNG 
VCLELEEKCKPFFEKLKLENELTHELKGSLDKDDECKKALRKHCSEQKNSANQKFNSL 
CNTDKDKDVEEKVCKKLVEKVKRKCPTLENKLNEEKNELKKKKDEYEKAKQESEKFTK 
EAKLVLSR PEQDGQGGGS KAQDGS VPKPVGP PVQ P PAPAQPT PGGVPAPT LAP PAQ PT 
S GGAP L PV P PAAPAA P GAP S T PGT PAAPAG PAAPGT P S T PS T P PAG PAG PS GGT PGAP 
AGPPAPGGSTPSGTTNTSNVILVRRTFVSGEVSEPEKKAFVATARALELYLELKEKCK 
GLKGDCEFRKDCPKCETVCKEIDELCEGIEGLKVTPHHTVTSTATQTTTTTATTTTTT 
TTTTTTTTTTTTTATTTESVDGGKVTE gCTLV^TT DTWVTSTSLHTSTLTSTSTVTST 
VTLTSMRKCKPTRCTSDSSKETETQKEEEKEEEVKPNEGMKIRVPEMIKIMLLGVl^M 

GML" 



1 caaaaatata tttttcttga tatccgtctc ttttctcaaa tttgtgcaat tatgaggatt 

61 gcattttttg cgctttttgc gcaacttagt tatgtcttgg ggagtttttt aagagataga 

121 gatttgecaa atgaggaaga tgtttatggt tatgaaaatt ttgggttgga tcccaattca 

181 ccggaaccga gtgaattttt aaatattata gaaatgette gaaatgcaaa gcgattagga 

241 agtgggaaga ttaaccaggg tcaatttctc tegaategta aggctaggag agattttgat 



BASE COUNT 
ORIGIN 



1616 a 



487 c 



940 g 



959 t 



AH*/QR VSR P* 
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LOCUS 

DEFINITION 

ACCESSION 
NJD 

KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



gene 
5 ' UTR 



BASE COUNT 
ORIGIN 

1 a 
61 t 

// 



D31920 81 bp mRNA PL. 22-SEP-1997 

Pmtpecystia carinii MSG mRNA for major surface glycoprotein, 

5' UTR. 

D31920 

g2443384 

MSG. 

mUpjOni nf 1 n carinii cDNA to mRNA. 

Pneumocystis carinii 

Eukaryotae; Fungi; Ascomycota; Archaeascomycetes; 
Pneumocystidaceae; Pneumocystis. 

1 (sites) 

Wada,M., Sunkin,S.M., Stringer , J. R . and Nakamura, Y. 
Antigenic variation by positional control of major surface 
glycoprotein gene expression in Pneumocystis carinii 
J. Infect. Dis. 171 (6), 1563-1568 (1995) 
95287050 

2 (bases 1 to 81) 
Nakamura, Y . 
Direct Submission 

Submitted ( 24- JUN-1994 ) to the DDBJ/EMBL/GenBank databases. 
Yoshikazu Nakamura, Institute of Medical Science, University of 
Tokyo, Tumor Biology; Shirokanedai 4-6-1, Minato-ku, Tokyo 108, 
Japan (E-mail : nak@hgc , ims . u-tokyo. ac . jp, Tel : 03-5449-5307, 
Fax:03-5449-5415) 

Location/Qualifiers 

1. . 81 

/organism^" Pneumocystis carinii" 

/db_xref="taxon:4754 M . 

1..81. 

/gene^"MSG" 
1. . 81 

/gene="MSG" 

/note="ma jor surface glycoprotein" 
30 a 10 c 13 g 28 t 

ttcatgaag taactatcaa gcttaataaa ggacccaata taggagtata ttgttaatat 
tatgetata tattgggatc c 
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LOCUS 

DEFINITION 

ACCESSION 
Nit) 

KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



gene 
5 ' UTR 



D31914 496 bp mRNA PL 22-SEP-1997 

Pneumocystis carinii MSG mRNA for major surface glycoprotein, 

D31914 

g2443378 

MSG. 

Pneumocystis carinii cDNA to mRNA. 
Pneumocystis carinii 

Eukaryotae; Fungi; Ascomycota; Archaeascomycetes; 
Pneumocystidaceae; Pneumocystis. 

1 (sites) 

Wada,M., Sunkin,S.M., Stringer, J. R. and Nakamuxfa, Y . 
Antigenic variation by positional control of major surface 
glycoprotein gene expression in Pneumocystis carinii 
J. Infect. Dis. 171 (6), 1563-1568 (1995) 
95287050 

2 (bases 1 to 496) 
Nakamura, Y . 
Direct Submission 

Submitted (24- JUN-1994 ) to the DDBJ/EMBL/ GenBank databases. 
Yoshikazu Nakamura, Institute of Medical Science, University of 
Tokyo, Tumor Biology; Shirokanedai 4-6-1, Minato-ku, Tokyo 108, 
Japan ( E-mail : nakQhgc . ims . u-tokyo . ac . jp, Tel : 03-5449-5307, 
Fax:03-5449-5415) 

Location/Qualifiers 

1 . . 496 

/organism^" Pneumocystis carinii'* 

/db_xref= n taxon:4754 n . , 

l..*496 

/gene="MSG" 

1 . . 496 

/gene="MSG" 

/note= f, ma j or surface glycoprotein" 
214 a 50 c 89 g 143 t 



BASE COUNT 
ORIGIN 

1 caactatcaa gcttaataaa ggacccaata taggagtata ttgttaatat ttatgctata 

61 tattgggatc cagaaataat atgaaatgct tttttccaaa attaataaaa ttatagttct 

121 aagaatatta attttaattg ttcctactta tggcaatcaa agcacacatt tgaaagccag 

181 agattctcaa gtaagcgcag caaatactcc gtttaaggat attgttgtta aggatgaaga 

241 aatccttgct tatattttga gagaagatta taaagataaa tgcgaactaa agcttgagga 

301 atattgtaaa gaattgaggg atatagatca aggattaaat aaagttcata ctatagttaa 

361 agaaatttgt gatgatgaaa aacgagacgg aaaatgcaaa gaactgaaag acaaigttaa 

421 aaaggaattg gaaactttta aagaggaact tgaaaaagca ttgaaagaca taaaagatga 
481 aaattgtgaa aaatat 

// 



lofl 
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LOCUS AF03522'. 1945 bp mRNA Pi^< 02-JAN-1998 

DEFINITION Pn«U»Qcy«ti« carinii glycoprotein A (gpA) mRNA, partial cds. 
ACCESSION AF035226 
NID" g2739185 
KEYWORDS 

SOURCE Pneumocystis carinii. 

ORGANISM Pneumocystis carinii 

Eukaryotae; Fungi; Ascomycota; Archiascomycetes; Pneumocystidaceae; 
Pneumocystis . 
1 REFERENCE 1 (bases 1 to 1945) 

AUTHORS Guadiz,G., Haidaris , C . G. , Maine, G.N. and Simpson-Haidaris , P . J . 
TITLE Direct Submission 

JOURNAL Submitted { 19-NOV-1997 ) Medicine-Vascular Medicine, University of: 
Rochester, 601 Elmwood Avenue, Box 610, Rochester, NY 14642, USA 
FEATURES Location/Qualifiers 
source 1 . . 1945 

/ organism="Pneumocystis carinii" 
/db_xref="taxon: 4754" 
/tissue_type="inf ected lung" 
/clone="2-lA" 
gene <1..1945 

/gene="gpA" 
CDS <1. . 1914 

/gene="gpA" 
/codon^start^l 
/product="glycoprotein A" 
/db_xref="PID:g27 3 918 6" 
..... , , . . /translation="KKELRKKNFCNDDSCSNEKDNWEKLVKLRDEDCAKLQSKCFYLE 

KHCSANLKKPCNNVRVTCYKRGLKAAAMTLLESVLKGKLKPGPNNDYKDCQKALLEKC 
KEVRNISAEVFEMCLYPENTCQNLSIDVQQKANYLSLFSLQNRDHPSQEDCVELQEKC 
EALEADADWLRPPCETLRTHCNFLYLSEKLKHHLLSEGKGKLSSNEICNKELNERCHS 
WHKKKNETYAFPCALRNESCELMVWRVGKHCDEFKENLKNYNTTLIKNPSEETCLLWA 
PHCENLTPNCEEKVAKDINETCRKLQEKCSPVLEKRNLKNKLKRELKGKLTDGKKTEC 
I DY FKS LCTQADHPNKTALDI LCKENGQNIDDDKAKEKKCQ PLI DE VQWECPLLKTKL 
EKAKDEVAKKADEYKKLKEEAEKAITETKLNVTELPKEAGKATLIRRDPKGSSPAKJIS 
VKVEWSVPITKEHAEAIDLLSQALDLYFELREECDALI LDCDFKEDCPQCKDSCNSI 
LATCKGLQPLTIAEPRIIEDQKPEAGKEDTPEGGQAGGQAAGKECTSITTSDTWVTST 
STHTTTTTSTKTTTVKVTLTSTRKCKPTKCTTGKGEPTSRGDEEEEWPSGGRKWLPS 
LSIVIVGVWALV" 
BASE COUNT 744 a 279 c 451 g 471 t 

ORIGIN 

1 aaaaaagaac ttcgtaaaaa gaatttctgc aatgacgata getgeagtaa tgaaaaagat 
61 aattgggaga aattagtaaa gttaagagat gaagattgtg caaaattaca atctaagtgt 
121 ttttatttag aaaaacattg ttcagctaat cttaaaaaac catgtaacaa tgttagagta 
181 acctgttaca agagaggatt aaaggcagcg gctatgactt tgttagagag tgtgttgaaa 
241 gggaaactta aaccaggtcc aaataatgat tataaagatt gecaaaaage attattagaa 
301 aagtgtaaag aagtgagaaa tatcagtgea gaagtgtttg agatgtgttt atatccagaa 
361 aatacgtgtc aaaatctttc tattgatgtt caacaaaaag caaattattt aagccttttt 
421 tcacttcaaa acagggacca tccaagtcaa gaagactgtg ttgaactaca agagaaatgt 
481 gaagctttag aagcagatgc agactggctt cgacctcctt gtgaaacatt gagaaegcat 
541 tgeaacttte tttacctttc agaaaagttg aaacatcatt tgttgagtga agggaaaggt 
601 aaattaagta gcaatgagat ttgcaacaaa gagttgaatg agaggtgtca ctcatggcat 
661 aaaaaaaaga aegaaaegta tgcttttccg tgtgctttgc gcaatgaatc ttgtgagttg 
721 atggtttggc gtgtaggaaa acattgtgat gaatttaaag aaaacttgaa aaattataat 
781 acaacattaa ttaaaaatcc atcagaagaa acatgtcttt tgtgggcacc acactgtgaa 
841 aatctgaege caaactgtga agaaaaggtg gctaaagata tcaatgagac ttgccgtaaa 
901 ctccaggaga aatgttcacc agttcttgag aagaggaatt tgaaaaataa attgaaacgt 
961 gagttgaaag ggaaattaac ggatggaaag aagacagaat gtatcgatta ttttaaaagt 
1021 ctttgtacac aggcagatca tccaaataaa aeggcacttg acatattatg taaagaaaat 
1081 ggacaaaata ttgatgatga taaagctaaa gaaaagaaat gccaaccact cattgatgaa 
1141 gtacaatggg agtgtccact attaaaaaca aagttagaaa aagcaaagga tgaagtggca 
1201 aagaaggcag atgaatataa aaaactaaaa gaagaggcag aaaaggctat tacggaaaca 
1261 aaactcaatg ttacagaact accaaaggaa gcaggaaagg caaccctgat tagaagagat 
1321 ccaaaaggca gcagtccagc caagagatca gtaaaggtag aggttgtgtc tgttccaata 
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1381 acgaaagaac at *g<?c aattgattta 

1441 ttgagagagg agt s .gatgc gttaatatta 

1501 tgtaaagatt catgtaatag tatacttget 

1561 gcggaaccac gtattattga agatcagaaa 

1621 ggaggacaag ctggaggaca agetgeggga 

1681 acatgggtca egagtaegtc tacgeatacg 

1741 gttaaagtga cgttaacgtc aacgaggaag 

1801 ggagagcega caagtagagg tgatgaagag 

1861 tggttgccaa gtctaagtat agttattgtt 

1921 atgaataaga cgttccatgt gaaat 



ttatctcaag cat 1 ;tct ttatttcgag 
gattgtgact tta^ ,agga ttgtcctcaa 
acctgtaaag ggttacaacc tcttacaatc 
ccagaagctg gaaaggaaga tactcctgaa 
aaggagtgca cctctatcac aacaagtgac 
acgacgacca cctcgacaaa gacgacgaca 
tgtaagcega ccaagtgtac cactggaaaa 
gaggaggtgg tgccgagtgg aggaaggaag 
ggtgtggtgg tggcactcgt gtagatcatg 
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LOCUS PCU5392j 2470 bp DNA Pu*. 24-DEC-1996 

DEFINITION Pneumocystis carinii major surface glycoprotein (MSG) gene, partial 
cds . 

ACCESSION U53921 
NlD gl750217 
KEYWORDS 

SOURCE Pneumocystis carinii. 

ORGANISM Pneumocystis carinii 

Eukaryotae; mitochondrial eukaryotes; Fungi; Ascomycota; 
Archaeascomycetes; Pneumocystidaceae; Pneumocystis. 
REFERENCE 1 (bases 1 to 2470) 

AUTHORS Edman,J.C, Hatton,T.W., Nam,M., Turner, R., Mei,Q., Angus, C.W. and 
Kovacs, J. A. 

TITLE A single expression site with a conserved leader sequence regulates 

variation of expression of the Pneumocystis carinii family of major 
surface glycoprotein genes 

JOURNAL DNA Cell Biol. 15 (11), 989-999 (1996) 

MEDLINE 97101113 
REFERENCE 2 (bases 1 to 2470) 

AUTHORS Kovacs, J. A. 

TITLE Direct Submission 

JOURNAL Submitted { 09-APR- 1996 ) Joseph A. Kovacs, CCMD/CC, NIH, Bldg . 10, 
RM. 7D43, MSC1662, Bethesda, MD 20892-1662, USA 
FEATURES Location/Qualifiers 
source 1. .2470 

/organism^" Pneumocystis carinii" 
/db_xref="taxon: 4754" 
... /lab_host="rat ,t -. •■<....> 
gene 1940 . .2470 

/gene="MSG" 

CDS join(<1940. .2036, 2187. .>2470) 

/gene="MSG" 

/note="gpA; expression site of MSG; encodes leader 
sequence of MSG; surface protein" 
/codon_start=l 

/product="ma j or surface glycoprotein" 
/db_xref="PID:gl750218" 

/ translation="MRIAFFALFAQLSCILVYSIAERDFMSLDEIYEGGDISFDHEKL 
EFNEYNQVLQMLEKAKKLGTGFVDRT.KDFSNRRYEGRIELNHLGRRPGVDYFRKGGDV 
FTDGYPRGGHLIEDELSEEAAMARP" 
BASE COUNT 707 a 335 c 411 g 1017 t 

ORIGIN \ 

1 gaattccata ataattgtat tgtaaattat gaatataatt ttgattattt 
61 tagtattttt ttataataat attttctatt atcaacttga tgttgaaaat 
121 atcttgatgt ttagagtaaa tgttgatatt ttcaagtcgt attaggtatc 
181 attttttaat tgaaaaggta taaagtatga ataatatata atatggggaa 
241 tagtatataa atcatagcaa catgattttt ttttgtattt aaatcagatt 
301 ttttttatat atttataaat cttgttatag tatatatttt caagtgtttt 
361 agttttaaag agagtgttaa ttaaggtaga aatgttttat taatggtgat 
421 atattgtttt ttgtctaata aaatgaatat tttctaaaat aaagattatt 
481 attgattaaa atgtagaatg tatgattatt tcaagtgatt gtatagtact 
541 cgtcctcgtc gtcctcgtcg tcctcgtcgt ggtagtgatg agagttttat 
601 attttaaaaa aaataaagat tatttatata tatatatatt tgaaaaaatt 
661 ttatttagta tgatttttaa attttacaag agttgaaagt tttgttgtat 
721 ttattaaggt ttagttggaa cgaagttgga agtttaatat atgttgaagt 
781 gggttaggtc gaacgtttag gtttaagaag aatatatagg gttaegtega 
841 tgaagtagaa tatatagggt taggtctaat gtttaggtta ggtcaaagtg 
901 tctaacattc ggggttaggt caaagtatag gttaggtcga cgtgtaggtt 
961 tatgtatagg gttagggtta ggttcactct tttttgaatg aaaaaacttt 
1021 aaaagtaata aaactcaaat ctatagatta acatggatcc gtttttatta 
1081 tgttttatgt ataaaacttt ctttttttta tttttacttg gtccctttct 
1141 atacttggaa ttttctgata tattacaata gatggttcat tcctctactc 
1201 tcttatttaa ctgaccattc atgctatttt tcccctattt tttataaaaa 
1261 ttatatatag aatggcgtat aegtaaaaat agaaacgtgt atgcgtgaaa 



ctttaaaata 
ategtaattt 
cgttggaatt 
ttttattttg 
ttataaaatg 
caatgggttt 
atatatatat 
tttttaaaaa 
agtccctcgt 
tgaatcatgt 
gaaaggagaa 
ttttatttca 
agaatatata 
aggtttaggt 
cagggttagg 
aggtctgagt 
tttttaataa 
taaaatttca 
ttccctcatc 
acttttaaaa 
gactcgtcgc 
tagaatggtg 
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1321 cttttaaatt tat :ttt attcaaatta 

1381 atccattatt ctt. .caca aagtatttct 

1441 tatttggaaa ectatgeata tactcgatat 

1501 tttcctttca ccccctctgg gacactcccc 

1561 tgtccctccc tcgacccttc tatacccatc 

1621 ttttactcat ttctatccta taaaaattga 

1681 gcatttttag gactggctaa tattttaagg 

1741 cccccatccg gaaattggct tttccttttt 

1801 cegtcaatat ttaagggtct ttgeatatta 

1861 atacccgttg cttcctttag cgcttggaaa 

1921 tcagtttgtt tgtgcaataa tgaggattgc 

1981 tattttagtt tattcaatag cagaaaggga 

2041 gttttttttc cttaatttcc tgttgtatca 

2101 tactcatctt tttactcatc tttctattct 

2161 tttatggttt tactcatgtt ttctagaagg 

2221' cgaatttaac gaatataatc aagttttaca 

2281 cggctttgtt gatagaacca aagatttttc 

2341 aaatcatttg gggagacgee caggagtcga 

2401 tgatggttat cctcgtggag gtcatttgat 
2461 ggcacggccg 



ggaaatcttt ttt *ttt tagttttgga 
ttttttccta aca -tat ttatacttta 
cactctgact ttgeatatte tactccctcc 
tgtceggaca caccccggac agctcttggc 
tatccatcta tccatctata tatacaccgt 
cttgtaattt tggatatatt ccttctatat 
tctagctctt tttatttccg gctagtcatc 
cagegtttte ttcgattcat ccgtaaattt 
aggggactgt ttctggactg tatttttacc 
atatattttt tcttgatatc cgtctcgtct 
attttttgeg etttttgege aacttagttg 
tttcatgtca ttagatgaaa tatatggtga 
ttttttacac gtcttttttt actegtcttt 
ctcctcttct tttggaaatc tctttttcct 
aggegatata agttttgatc atgaaaaact 
aatgcttgaa aaggcaaaaa aattgggaac 
taatagacga tatgaaggga gaattgagtt 
ctattttagg aaaggtgggg atgtttttac 
cgaggatgag ttgtccgaag aggeggcaat 
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LOCUS 

DEFINITION 

ACCESSION 

NID 

KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



AF03522* 1945 bp mRNA P-« 02-JAN-1998 

Pneumocystis carinii glycoprotein A (gpA) mRNA, partial cds. 
AF035226 
g2739185 

Pneumocystis carinii. 
Pneumocystis carinii 

Eukaryotae; Fungi; Ascomycota; Archiascomycetes; Pneumocystidaceae; 
Pneumocystis. 

1 (bases 1 to 1945) , 
Guadiz,G., Haidaris,C.G., Maine, G.N. and Simpson-Haidaris, P.J. 

Direct Submission . . - 

Submitted (19-NOV-1997) Medicine-Vascular Medicine, University of 
Rochester, 601 Elmwood Avenue, Box 610, Rochester, NY 14642, USA 
Location/Qualifiers 
1. .1945 

/organism=" Pneumocystis carinii" 
/db_xref="taxon: 4754" 
/tissue_type="infected lung" 
/clone="2-lA" 
<1..1945 
/gene="gpA" 
<1. .1914 
/gene="gpA" 
/codon_start=l 
/product="glycoprotein A" 
/db xref="PID:g2739186" 

/translation="KKELRKKNFCNDDSCSNEKDNWEKLVKLRDEDCAKLQSKCFYLE 
KHCSANLKKPCNNVRVTCYKRGLKAAAMTLLESVLKGKLKPGPNNDYKDCQKALLEKC 
KEVRNISAEVFEMCLYPENTCQNLSIDVQQKANYLSLFSLQNRDHPSQEDCVELQEKC 
EALEADADWLRPPCETLRTHCNFLYLSEKLKHHLLSEGKGKLSSNEICNKELNERCHS 

WHKKKNETYAFPCALRNESCELMVWRVGKHCDE FKENLKNYNTTLIKNPSEETCLLWA 
PHCENLTPNCEEKVAKDINETCRKLQEKCSPVLEKRNLKNKLKRELKGKLTDGKKTEC 

I DY FKS LCTQADHPNKTALDI LCKENGQNI DDDKAKEKKCQ PL I DE VQWEC PLLKTKL 
EKAKDEVAKKADEYKKLKEEAEKAITETKLNVTELPKEAGKATLIRRDPKGSSPAKRS 
VKVEVVSVPITKEHAEAIDLLSQALDLYFELREECDALILDCDFKEDCPQCKDSCNSI 
LATCKGLQPLTIAEPRIIEDQKPEAGKEDTPEGGQAGGQAAGKECTSITTSDTWVTST 
STHTTTTTSTKTTTVKVTLTSTRKCKPTKCTTGKGEPTSRGDEEEEWPSGGRKWLPS 



BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 



LSIVIVGWVALV" 
744 a 279 c 451 g 



aaaaaagaac 
aattgggaga 
ttttatttag 
acctgttaca 
gggaaactta 
aagtgtaaag 
aatacgtgtc 
tcacttcaaa 
gaagctttag 
tgcaactttc 
aaattaagta 
aaaaaaaaga 
atggtttggc 
acaacattaa 
aatctgacgc 
ctccaggaga 
gagttgaaag 
ctttgtacac 
ggacaaaata 
gtacaatggg 
aagaaggcag 
aaactcaatg 
ccaaaaggca 



ttcgtaaaaa 
aattagtaaa 
aaaaacattg 
agagaggatt 
aaccaggtcc 
aagtgagaaa 
aaaatctttc 
acagggacca 
aagcagatgc 
tttacctttc 
gcaatgagat 
acgaaacgta 
gtgtaggaaa 
ttaaaaatcc 
caaactgtga 
aatgttcacc 
ggaaattaac 
aggcagatca 
ttgatgatga 
agtgtccact 
atgaatataa 
ttacagaact 
gcagtccagc 



gaatttctgc 
gttaagagat 
ttcagctaat 
aaaggcagcg 
aaataatgat 
tatcagtgca 
tattgatgtt 
tccaagtcaa 
agactggctt 
agaaaagttg 
ttgcaacaaa 
tgcttttccg 
acattgtgat 
atcagaagaa 
agaaaaggtg 
agttcttgag 
ggatggaaag 
tccaaataaa 
taaagctaaa 
attaaaaaca 
aaaactaaaa 
accaaaggaa 
caagagatca 



471 t 

aatgacgata 
gaagattgtg 
cttaaaaaac 
gctatgactt 
tataaagatt 
gaagtgtttg 
caacaaaaag 
gaagactgtg 
cgacctcctt 
aaacatcatt 
gagttgaatg 
tgtgctttgc 
gaatttaaag 
acatgtcttt 
gctaaagata 
aagaggaatt 
aagacagaat 
acggcacttg 
gaaaagaaat 
aagttagaaa 
gaagaggcag 
gcaggaaagg 
gtaaaggtag 



gctgcagtaa 
caaaattaca 
catgtaacaa 
tgttagagag 
gccaaaaagc 
agatgtgttt 
caaattattt 
ttgaactaca 
gtgaaacatt 
tgttgagtga 
agaggtgtca 
gcaatgaatc 
aaaacttgaa 
tgtgggcacc 
tcaatgagac 
tgaaaaataa 
gtatcgatta 
acatattatg 
gccaaccact 
aagcaaagga 
aaaaggctat 
caaccctgat 
aggttgtgtc 



tgaaaaagat 
atctaagtgt 
tgttagagta 
tgtgttgaaa 
attattagaa 
atatccagaa 
aagccttttt 
agagaaatgt 
gagaacgcat 
agggaaaggt 
ctcatggcat 
ttgtgagttg 
aaattataat 
acactgtgaa 
ttgccgtaaa 
attgaaacgt 
ttttaaaagt 
taaagaaaat 
cattgatgaa 
tgaagtggca 
tacggaaaca 
tagaagagat 
tgttccaata 
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1381 acgaaagaac at-, aggc aattgattta 
1441 ttgagagagg agt^ -gatgc gttaatatta 
1501 tgtaaagatt catgtaatag tatacttgct 
1561 gcggaaccac gtattattga agatcagaaa 
1621 ggaggacaag ctggaggaca agctgcggga 
1681 acatgggtca cgagtacgtc tacgcatacg 
1741 gttaaagtga cgttaacgtc aacgaggaag 
1801 ggagagccga caagtagagg tgatgaagag 
1861 tggttgccaa gtctaagtat agttattgtt 
1921 atgaataaga cgttccatgt gaaat 



ttatctcaag cat \tct ttatttcgag 
gattgtgact tta- .agga ttgtcctcaa 
acctgtaaag ggttacaacc tcttacaatc 
ccagaagctg gaaaggaaga tactcctgaa 
aaggagtgca cctctatcac aacaagtgac 
acgacgacca cctcgacaaa gacgacgaca 
tgtaagccga ccaagtgtac cactggaaaa 
gaggaggtgg tgccgagtgg aggaaggaag 
ggtgtggtgg tggcactcgt gtagatcatg 
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LOCUS 

DEFINITION 

ACCESSION 
NID 

KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



cds . 

U53921 

gl750217 

Pneumocystis carinii . 

EEESS\££££*i-l eu*aryotes ; Fu„ gi; Asc-ycota, 
Mch.;a.comyct«.l meumocystidaceaa! Pneumocystis. 

11*™%.*.. «-.«.. Turner^., 1,0.. Anous.c.w. «d 

Kovacs,J.A. conserved leader sequence regulates 

^:„ e or:^:"sioro?iL%„:^:ystis ..^i « 1? .« -i« 

97101113 

2 (bases 1 to 2470) 
Kovacs, J. A. 

SS^tSToSiS-m*) -sepn A- Kovacs CCJ./CC, MIH. Bldg. 10, 

RM. 7D43, MSC1662, Bethesda, MD 20892-1662, USA 

Location/Qualifiers 

1..2470 

/organism="Pneumocystis carinii 

/db_xref="taxon: 4754" 

/lab_host="rat* ' 

1940. .2470 

/gene=**MSG" 

join(<1940. .2036,2187. .>2470) 

/no?e=»gS;' expression site of MSG; encodes leader 
sequence of MSG; surface protein 
/codon start=l 

/product="major surface glycoprotein 

/cab_ Xr ef=»PID:gl^50218" RDFMSLDEIYEGGDIS FDHEKL 

FTDGYPRGGHLIEDELSEEAAMARP" 
335 c 411 g 1017 t 



BASE COUNT 707 a 

ORIGIN ^ 4. „ f v a f aaatataatt ttqattattt ctttaaaata 

1 gaattccata ataattgtat tgtaaattat ^atataatt J ategtaattt 
61 tagtattttt "ataataat attttctatt atcaact g « * cgttgga att 
121 atcttgatgt ttagagtaaa tgttgatatt atatggggaa ttttattttg 

181 attttttaat tgaaaaggta taaagtatga J^aatatata ^ «ggg ttataaaatg 
241 tagtatataa atcatagcaa catgattttt ttttgtattt caatgggttt 
301 ttttttatat atttataaat cttgttatag tatatatttt «ag g tatat 
361 agttttaaag agagtgttaa ttaaggtaga "tgttttat taatgg g tttttaaaaa 
421 atattgtttt ttgtctaata -atgaatat tttctaaaat aaag agtccctcgt 
481 attgattaaa atgtagaatg tatgattatt ^caagcg « * tgaatcatgt 
541 cgtcctcgtc gtcctcgtcg tcctcgtcgt J^agtgatg ag J gaa aggagaa 
601 attttaaaaa aaataaagat tatttatata tatataca J ttttatttca 
661 ttatttagta tgatttttaa attttacaag agttgaaagt «9 atatata 
721 ttattaaggt ttagttgg.a c ? aagttgga agtttaatat -^J J aggtttaggt 
781 gggttaggtc gaacgtttag Jtttaagaag aat gg « caaa cagggt tagg 
841 tgaagtagaa tatatagggt ta ggtctaat g« cqtgtaggtt aggtctgagt 

901 tctaacattc ggggttaggt ^aagtatag gtaggtcga cgtg gg 
9 61 tatgtatagg gttagggtta ^"-ctct tttttgaatg a a taaaatttca 

1021 aaaagtaata aaactcaaat ^tatagatta £atgg gtccctttct ttccctcatc 

\n\ sssss ;s ssns : f« < : ™ j-ee 

US ESSE uses 
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1321 cttttaaatt tac *Jttt attcaaatta 
1381 atccattatt ctt N ..caca aagtatttct 
1441 tatttggaaa cctatgcata tactcgatat 
1501 tttcctttca ccccctctgg gacactcccc 
1561 tgtccctccc tcgacccttc tatacccatc 
1621 ttttactcat ttctatccta taaaaattga. 
1681 gcatttttag gactggctaa tattttaagg 
1741 cccccatccg gaaattggct tttccttttt 
1801 ccgtcaatat ttaagggtct ttgcatatta 
1861 atacccgttg cttcctttag cgcttggaaa 
1921 tcagtttgtt tgtgcaataa tgaggattgc 
1981 tattttagtt tattcaatag cagaaaggga 
2041 gttttttttc cttaatttcc tgttgtatca 
2101 tactcatctt tttactcatc tttctattct 
2161 tttatggttt tactcatgtt ttctagaagg 
2221 cgaatttaac gaatataatc aagttttaca 
2281 cggctttgtt gatagaacca aagatttttc 
2341 aaatcatttg gggagacgcc caggagtcga 
2401 tgatggttat cctcgtggag gtcatttgat 
2461 ggcacggccg 



ggaaatcttt ttt ".ttt tagttttgga 
ttttttccta aca ^tat' ttatacttta 
cactctgact ttgcatattc tactccctcc 
tgtccggaca caccccggac agctcttggc 
tatccatcta tccatctata tatacaccgt 
cttgtaattt tggatatatt ccttctatat 
tctagctctt tttatttccg gctagtcatc 
cagcgttttc ttcgattcat ccgtaaattt 
aggggactgt ttctggactg tatttttacc 
atatattttt tcttgatatc cgtctcgtct 
attttttgcg ctttttgcgc aacttagttg 
tttcatgtca ttagatgaaa tatatggtga 
ttttttacac gtcttttttt actcgtcttt 
ctcctcttct tttggaaatc tctttttcct 
aggcgatata agttttgatc atgaaaaact 
aatgcttgaa aaggcaaaaa aattgggaac 
taatagacga tatgaaggga gaattgagtt 
ctattttagg aaaggtgggg atgtttttac 
cgaggatgag ttgtccgaag aggcggcaat 
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DEFINITION 

ACCESSION 
NID 

KEYWORDS 
SOURCE 

ORGANISM 



PCU3966>i3|^ 629 b P ntfWA 

Pneumocystis carinii major surface glycoprotein 
cds . 
U39660 
g!079706 



29-NOV-1995 
(msg) mRNA, partial 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 



Fungi ; Ascomyco t a ; 
Pneumocystis . 



BASE COUNT 
ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
// 



Pneumocystis carinii . 
Pneumocystis carinii 

Eukaryotae; mitochondrial eukaryotes; 
Archaeascomycetes; Pneumocystidaceae; 
1 (bases 1 to 629) 

Kovacs,J.A., Edman, J.C. and Angus, C.W. 
Direct Submission 

Submitted ( 30-OCT-1995 ) J. A. Kovacs, CCMD, NIH, Building 10, Rm 
7D43, MSC 1662, Bethesda, MD 20892-1662, USA 
Location/Qualifiers 
1. .629 

/organisms" Pneumocystis carinii" 
/db_xref="taxon:4754" 
55. . 629 
/gene="msg" 
55. .>629 
/gene="msg" 
/codon_start=l 

/product="ma jor surface glycoprotein" 
/db_xref="PID:gl079707" 

/ translation="MRIAFFALFAQLSCILVYSIAERDFMSLDEIYGGDISFDHEKLE 
FNEYNQVLQMLEKAKKLGTGFVDRTKDFSNRRYEGRIELNHLGRRPGVDYFRKGGDVF 
TDGY PRGGHLI EDELSEEAAMARPVKRQAVQGAQDE I DEKHLLAFI VKDKYKEEQKCK 
EELEKYCKELKEADKNLENVDDKVKGLCDDK" 
misc_dif f erence 228.. 234 

/gene="msg" 

/note="in some cDNA clones, there are 6 A's at this 
position, instead of 7. This appears to be due to an error 
in reverse transcription" 
/replace="aaaaaa" 
218 a 73 c 154 g 184 t 



CDS 



ggaaaatata ttttttcttg atatccgtct cgtcttcagt ttgtttgtgc aataatgagg 
attgcatttt ttgcgctttt tgcgcaactt agttgtattt tagtttattc aatagcagaa 
agggatttca tgtcattaga tgaaatatat ggaggcgata taagttttga tcatgaaaaa 
ctcgaattta acgaatataa tcaagtttta caaatgcttg aaaaggcaaa aaaattggga 
accggctttg ttgatagaac caaagatttt tctaatagac gatatgaagg gagaattgag 
ttaaatcatt tggggagacg cccaggagtc gactatttta ggaaaggtgg ggatgttttt 
actgatggtt atcctcgtgg aggtcatttg atcgaggatg agttgtccga agaggcggca 
atggcacggc cggttaagag gcaagcagta caaggagcac aagatgagat tgatgagaaa 
caccttttgg ctttcattgt gaaggacaaa tataaagaag aacaaaaatg caaagaagaa 
ctcgagaaat attgtaaaga gttgaaggaa gcagataaaa atctagagaa tgtggatgat 
aaagttaaag gactttgtga tgataaaaa 
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ACCESSION 
NID 

KEYWORDS 
SOURCE 

ORGANISM 



06-JUN-1998 
mRNA, complete 



Pneumocystidaceae; 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

FEATURES 

source 



gene 



Gigliotti,F. and 



Ctr. ( 



CDS 



AF04310^^ 4002 bp mRNA 

Pneumocystis carinii surface glycoprotein A (gpA) 
cds. 

AF043102 
g3184385 

Pneumocystis carinii. 
Pneumocystis carinii 

Eukaryota; Fungi; Ascomycota; Archiascomycetes; 
Pneumocystis . 

1 (bases 1 to 4002) 

Haidaris, C.G. , Medzihradsky, O. F. , Gigliotti,F. and 
Simpson-Haidaris, P.J. — ^ 

Molecular characterization dz mouse^/Pneumocystis carinii surface 

glycoprotein A ^ 

DNA Res. 5, 77-85 (1998) 

2 (bases 1 to 4002) 
Haidaris, C.G. , Medzihradsky, O. 
Simpson-Haidaris, P.J. 
Direct Submission 

Submitted ( 14- JAN-1998 ) Microbiol/Immunol, U. Rochester Med. 
Box 672, 601 Elmwood Ave., Rochester, NY 14642, USA 
Location/ Qualifiers 
1. .4002 

/organism=" Pneumocystis carinii" 
/db_xref="taxon: 4754" 
/lab_host="C.B-17 SCID mouse" 
1. .4002 
/gene=" gpA" 
52. .3849 
/gene="gpA" 
/codon_start=l 

/product="surf ace glycoprotein A" 
/db_xref="PID:g3184386" 

/ translation="MRIAFFALFAQLSYVLGSFLRDRDLPNEEDVYGYENFGLDPNSP 
EPSEFLNIIEMLRNAKRLGSGKINQGQFLSNRKARRDFDLCRSCNRPGVDYFRKSDYD 
GFFSEDFSSEDYSQDKRWVEEVAQKEAAMAQPVKRQAAGQAAGNDEIKEEQVLGLIVK 
SGYNNDNKCKANLKHYCEELKKIDGKLESVDVKVKGLCENGKEGEKCKELKKKLETEL 
GAFKTEVENALNNLTDEKCRKYEEKCLLLEEGDPNNLEEKCVKLRDRCYRQRRQGVAK 
EILLRALEGKVNNKDECKKRMKEICQGLSEYSDELVFSCFNSDKTCEYLQKNHGDSCK 
PLEKELEDKELVEKCQEYLEKCYFYGSSCKDTKCDKVNNKCKGKGIEYEGPKLDFSPV 
REKPRFPEKIEVENLYKKEEAKGIIVGKPKYKTLRDLALLLIKERNGKDEGEKCKKAL 
EDCESFKHLDYGLEELCGDKDKEDRCKELVEVEDRCTNFKLELYLKGLSTEFEKDKES 
DYFSWGQVSKLVSMEDCIKFESECFHLERVCTNKIGKACENVRVACYKKGQDRVLNRY 
FQEGLKGLIGDLELVTENLEKCQKSWGNYTKLKEDRRYFTKCHLPTKLCYELLDDVI 
LQSEELEWLNLRRDFPRKEDCVELKKKCKDLESDSYLNHEKCDTLNRRCEYLKVTEE 
LRKRLLKRGDDALRTQGNCTAVLKKECEELSKRGKDEFSVSCALREETCSFMVEQTEN 
ECLFLKNNMENGKIINKIEKGNETLVEELCTLFDPYCHQYIENCPDRLKKASNSNKNG 
VCLELEEKCKPFFEKLKLENELTHELKGSLDKDDECKKALRKHCSEQKNSANQKFNSL 
CNTDKDKDVEEKVCKKLVEKVKRKCPTLENKLNEEKNELKKKKDEYEKAKQESEKFTK 
EAKLVLSRPEQDGQGGGSKAQDGSVPKPVGPPVQPPAPAQPTPGGVPAPTLAPPAQPT 
S GGAPL PV P PAAPAAPGAPS T PGT PAAPAG PAAPGT PST PS T P PAG PAG P S GGT PGAP 
AGPPAPGGSTPSGTTNTSNVILVRRTFVSGEVSEPEKKAFVATARALELYLELKEKCK 
GLKGDCEFRKDCPKCETVCKEIDELCEGIEGLKVTPHHTVTSTATQTTTTTATTTTTT 
TTTTTTTTTTTTTATTTESVDGGKVTE lgCTLV^ TTDTWVTSTSLHTSTLTSTSTVTST 
VTLTSMRKCKPTRCTSDSSKETETQKEEEKEEEVKPNEGMKIRVPEMIKIMLLGVIV]^ 
GML" 

BASE COUNT 1616 a 487 c 940 g 959 t 

ORIGIN 

1 caaaaatata tttttcttga tatccgtctc ttttctcaaa tttgtgcaat tatgaggatt 
61 gcattttttg cgctttttgc gcaacttagt tatgtcttgg ggagtttttt aagagataga 
121 gatttgccaa atgaggaaga tgtttatggt tatgaaaatt ttgggttgga tcccaattca 
181 ccggaaccga gtgaattttt aaatattata gaaatgcttc gaaatgcaaa gcgattagga 
241 agtgggaaga ttaaccaggg tcaatttctc tcgaatcgta aggctaggag agattttgat 
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KEYWORDS 
SOURCE 

ORGANISM 



ysna 



81 bp mRNA PflBP 22-SEP-1997 

s carinii MSG mRNA for major surface glycoprotein, 



D31920 
Pn*ipf»cys 
5'UTR. 
D31920 
g2443384 
MSG. 

TOfp|OCystis carinii cDNA to mRNA. 

Pneumocystis carinii 

Eukaryotae; Fungi; Ascomycota; Archaeascomycetes; 
Pneumocystidaceae; Pneumocystis. 

1 (sites) 

Wada,M., Sunkin,S.M., Stringer, J. R. and NakamuraJ. 
Antigenic variation by positional control of major surface 
glycoprotein gene expression in Pneumocystis carinii 
J. Infect. Dis. 171 (6), 1563-1568 (1995) 
95287050 

2 (bases 1 to 81) 
Nakamura, Y. 
Direct Submission 

Submitted (24-JUN-1994 ) to the DDBJ/EMBL/ GenBank databases. 
Yoshikazu Nakamura, Institute of Medical Science, University of 
Tokyo, Tumor Biology; Shirokanedai 4-6-1, Minato-ku, Tokyo 108, 
Japan (E-mail : nak@hgc . ims . u-tokyo .ac.jp, Tel : 03-5449-5307 , 
Fax:03-5449-5415) 

Location/Qualifiers 
1. .81 

/organism 32 " Pneumocystis carinii" 

/db_xref="taxon:4754" 
gene 1 . . 81 

/gene= M MSG" 
5'UTR 1..81 

/gene="MSG" 

/note= M major surface glycoprotein" 
BASE COUNT 30 a 10 c 13 g 28 t 

ORIGIN 

1 attcatgaag taactatcaa gcttaataaa ggacccaata taggagtata ttgttaatat 
61 ttatgctata tattgggatc c 

// 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 
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LOCUS 

DEFINITION 

ACCESSION 
NID 

KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



gene 



5'UTR 



BASE COUNT 
ORIGIN 

1 



:ysTis 



D31914 496 bp mRNA 22-SEP-1997 

Pneumocystis carinii MSG mRNA for major surface glycoprotein, 
5 ',113ft. « 
D3T914 
g2443378 
MSG. 

Pneumocystis carinii cDNA to mRNA. 
Pneumocystis carinii 

Eukaryotae; Fungi; Ascomycota; Archaeascomycetes; 
Pneumocystidaceae; Pneumocystis. 

1 (sites) 

Wada,M., Sunkin,S.M., Stringer, J. R. and Nakamura, Y . 
Antigenic variation by positional control of major surface 
glycoprotein gene expression in Pneumocystis carinii 
J. Infect. Dis. 171 (6), 1563-1568 (1995) 
95287050 

2 (bases 1 to 496) 
Nakamura, Y. 
Direct Submission 

Submitted (24- JUN-1994 ) to the DDB J /EMBL/ GenBank databases. 
Yoshikazu Nakamura, Institute of Medical Science, University of 
Tokyo, Tumor Biology; Shirokanedai 4-6-1, Minato-ku, Tokyo 108, 
Japan (E-mail : nakdhgc . ims . u-tokyo . ac . jp, Tel : 03-5449-5307, 
Fax:03-5449-5415) 

Location/Qualifiers 

1. .496 

/organism=" Pneumocystis carinii" 
/db_xref= n taxon:4754" 
1. .496 
/gene="MSG" 
1. .496 
/gene="MSG" 
/note="major surface 
214 a 50 c 89 g 



glycoprotein" 
143 t 



// 



caactatcaa gcttaataaa ggacccaata taggagtata ttgttaatat ttatgctata 
61 tattgggatc cagaaataat atgaaatgct tttttccaaa attaataaaa ttatagttct 
121 aagaatatta attttaattg ttcctactta tggcaatcaa agcacacatt tgaaagccag 
181 agattctcaa gtaagcgcag caaatactcc gtttaaggat attgttgtta aggatgaaga 
241 aatccttgct tatattttga gagaagatta taaagataaa tgcgaactaa agcttgagga 
301 atattgtaaa gaattgaggg atatagatca aggattaaat aaagttcata ctatagttaa 
361 agaaatttgt gatgatgaaa aacgagacgg aaaatgcaaa gaactgaaag acaaagttaa 
421 aaaggaattg gaaactttta aagaggaact tgaaaaagca ttgaaagaca taaaagatga 
481 aaattgtgaa aaatat 
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